Add Falcon3 model support #10864

mokeddembillel · 2024-12-17T09:49:41Z

Adding Falcon3 model support

This reverts commit 382bc7f.

slaren · 2024-12-18T00:54:27Z

@mokeddembillel heads up, this has been reverted because the change to convert_hf_to_gguf.py was creating gguf files with broken tokenizers. This will need to be fixed before it can be added again.

mokeddembillel · 2024-12-18T03:31:49Z

@slaren @ggerganov, Thanks for flagging this. working on fixing it right now.

mokeddembillel · 2024-12-18T05:33:41Z

@slaren @ggerganov Thanks again for flagging this issue.

The issue is that when using meta-llama/Llama-3.1-8B-Instruct the <|begin_of_text|> token is added to every special token when doing token = tokenizer.decode(tokenizer.encode(token))

the screenshot shows before and after token = tokenizer.decode(tokenizer.encode(token))

I'm fixing this by adding add_special_tokens=False to tokenizer.encode(). Here is the the result after the fix

to be extra safe, we will use token = tokenizer.decode(tokenizer.encode(token)) only if len(token) == 1 so that still fix this issue when \n is econded as Ċ

Generation before the fix:

Prompt: Once upon a time in a land far away,
there was a kingdom ruled by a wise and just king. The kingdom was known for its beauty and prosperity, and the people lived in peace and harmony.ĊĊOne day, a terrible drought struck the land, and the crops began to wither and die. The king, worried about the well-being of his people, called upon his wise council to find a solution. The council, after much deliberation, decided to send a group of brave knights to search for a magical spring that was said to have the power to bring rain to the kingdom.

Generation after the fix:

Prompt: Once upon a time in a land far away,
there was a kingdom ruled by a wise and just king. The kingdom was known for its beauty and prosperity, and the people lived in peace and harmony.

One day, a terrible drought struck the land, and the crops began to wither and die. The king, worried about the well-being of his people, called upon his wise council to find a solution. The council, after much deliberation, decided to send a group of brave knights to search for a magical spring that was said to have the power to bring rain to the kingdom.

Created new PR with the Fix #10883

This reverts commit 382bc7f.

Add Falcon3 model support

d146334

github-actions bot added the python python script changes label Dec 17, 2024

ggerganov approved these changes Dec 17, 2024

View reviewed changes

Vali-98 mentioned this pull request Dec 17, 2024

[Bug]: App crush when try Add new model. ChatterUI v0.8.3-beta4 Vali-98/ChatterUI#148

Open

ggerganov merged commit 382bc7f into ggerganov:master Dec 17, 2024
51 checks passed

dranger003 mentioned this pull request Dec 17, 2024

Eval bug: PR#10864 tokenization regression #10875

Closed

slaren added a commit that referenced this pull request Dec 17, 2024

Revert "llama : add Falcon3 support (#10864)"

e10dc00

This reverts commit 382bc7f.

slaren mentioned this pull request Dec 17, 2024

Revert "Add Falcon3 model support" #10876

Merged

slaren added a commit that referenced this pull request Dec 18, 2024

Revert "llama : add Falcon3 support (#10864)" (#10876)

4da69d1

This reverts commit 382bc7f.

mokeddembillel mentioned this pull request Dec 18, 2024

Add Falcon3 support and Fix issue #10875 #10883

Merged

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Dec 20, 2024

llama : add Falcon3 support (ggerganov#10864)

6a27965

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Dec 20, 2024

Revert "llama : add Falcon3 support (ggerganov#10864)" (ggerganov#10876)

5b74740

This reverts commit 382bc7f.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Falcon3 model support #10864

Add Falcon3 model support #10864

mokeddembillel commented Dec 17, 2024

slaren commented Dec 18, 2024

mokeddembillel commented Dec 18, 2024

mokeddembillel commented Dec 18, 2024

Add Falcon3 model support #10864

Add Falcon3 model support #10864

Conversation

mokeddembillel commented Dec 17, 2024

slaren commented Dec 18, 2024

mokeddembillel commented Dec 18, 2024

mokeddembillel commented Dec 18, 2024